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Abstract: Given two unsorted lists each of length N that have a single common entry, a quantum 
computer can find that matching element with a work factor of 0(N 3 ^ 4 log N) (measured in 
quantum memory accesses and accesses to each list). The amount of quantum memory required 
is O^N 1 / 2 ). The quantum algorithm that accomplishes this consists of an inner Grover search 
combined with a partial sort all sitting inside of an outer Grover search. 
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Given two databases, each containing iV entries, an important question is often to 
locate a common entry in both databases, if one exists. If the databases are unstructured, 
then this can be a hard problem. This note gives a quantum mechanical algorithm for 
solving this matching problem in time O (iV 3 / 4 log iV) , which is better than the best known 
classical algorithm for this problem. 

Although this bears some superficial resemblance to the collision problem, for which 
a quantum mechanical solution was given in [BHT], the question being addressed here 
actually is quite different. In the collision problem, stated as a database problem, it was 
assumed a single database contained every entry exactly twice, and the goal was to find a 
single example of a duplicate entry. Here, there are two databases and it is assumed that 
there is exactly one entry shared by both of them, and the problem is to locate that one 
entry in both databases. 

The solution here, in common with the solution to the collision problem given in 
[BHT], uses Grover's search algorithm as a subroutine, and all of the quantum mechanics 
is relegated to what goes on inside of Grover's algorithm. All that needs to be understood 
for using Grover's algorithm is what its resource requirements are. Another common 
feature of the solution here with the collision problem is the use of a sorting subroutine 
on a sublist and the fact that a desired item can be located quickly in a sorted list, a 
procedure often referred to as "insertion into a sorted list" . 

The cost of running an algorithm is equivalent to the time it takes to run. In the 
model here, static memory does not add to the cost, but reads and writes from and to 
memory do count. In addition, access to the databases also count towards the cost. Since 
all cost estimates in this note are done up to a constant factor, all that really needs to be 
counted are the number of internal memory accesses for the algorithm and the number of 
databases queries that the algorithm makes. 
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The Classical Solution to the Matching Problem 

It may be useful to begin with an exact statement of the problem to be solved: There 
are two lists L\ and L 2 each of length N, and there is a element Iq common to both L\ 
and L 2 . The object is to find that common element. 

An exhaustive solution to the matching problem would be simply to check all possible 
pairs, where the first element of each pair comes from L\ and the second element of each 
pair comes from L 2 . Since there are iV 2 such pairs, the total amount of work is (3(iV 2 ), 
where the implied constant contains the work of accessing the pair of elements from the 
two lists and the work in comparing the two elements to see if they match. 

There is a much better classical solution, however. Sort the first list Li and then run 
down all the elements of L 2 and check to see which one occurs in sorted (Li). The cost 
of sorting L\ using an efficient classical sorting algorithm is O(iVlogiV). Such sorting 
algorithms are discussed in detail in [K] , [NR] and [S] , and can even be done in place. The 
constant in the notation here includes the cost of accessing two elements from Li, the cost 
of comparing these two elements, and the cost of swapping these two elements. To check a 
single element of L 2 against sorted (Li) is the problem of insertion into a sorted list, and 
using a standard divide-and-conquer strategy, can be accomplished with cost O (log A/"), 
where the constant here absorbs the cost of accessing an element from sorted (Li) and 
the cost of comparing this element to the element of L 2 being tested. Since there are N 
elements of L 2 , the entire cost of searching through all of L 2 is (9(iVlog A/"), and the entire 
cost of the algorithm is therefore O(iVlogiV). 

Of course, it is also possible to sort L 2 instead and then search through L\ for the 
match. Alternatively, it is possible to sort both L x and L 2 producing two sorted lists, 
sorted (Li) and sorted (L 2 ) with cost 0(iV log iV) and then go down both sorted lists with 
a pair of pointers looking for a match. The cost of this last step is O (AT) , so the total cost 
is still O(NlogN). 

Grover's Algorithm for Quantum Searching 

In a very interesting and widely referenced paper [Gr], Lov Grover has shown that 
searching an unordered list can be done much faster on a quantum computer than on a 
classical computer. Classically for a list of size N, searching for a single item will cost 
O(N) accesses to the list on average. Grover's quantum algorithm only costs 0(y/N) 
accesses to the list. A good analysis of Grover's algorithm can be found in [BBHT]. It has 
been shown in [Z] that with no other structure present in the list, the best than can be 
done with a quantum algorithm is 0(VrV). 

Use of the Grover search algorithm as a subroutine is what really constitutes the 
quantum part of the new algorithm. It isn't absolutely necessary to go into the specifics of 
how Grover's algorithm works in detail to understand how it is used, but it is important 
to state what Grover's algorithm actually does and the cost associated with invoking it as 
a subroutine. 

Abstractly, Grover's algorithm goes as follows: Start with some arbitrary function 
F : X — > Y between finite sets having no a priori structure. Given some yo G Y, the object 
is to find an x G X with F(x) = yo : provided such an x exists. If k = \ {x G X | F(x) = yo}\ 
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denotes the number of different solutions, then Grover's algorithm gives finds such a solu- 
tion (provided that one exists) after an expected number of 0(a/|A|//c) evaluations of the 
function F (which must be implemented quantum mechanically). Although this doesn't 
give an exponential speedup, it is considerably better than exhaustively searching through 
all of X. In the present application, k = 1 since there will be (at most) one possible 
solution. In the usual formulation of Grover's algorithm \X\ is a power of 2, and for the 
application here, this is not a serious restriction. 

The usual way that Grover's algorithm is invoked is to compute an indicator function 
XF, yo defined by XF, yo (yo) = 1 if F(x) ^ y and XF, yo (x) = for all x with F(x) ^ y . In 
order to compute XF,y (x), the the value of F{x) will need to be computed, and this has 
to be done quantum mechanically. Although F{x) may be described classically, it is not 
necessary that F(x) be computed classically through reversible classical logic, and if there 
is a quantum algorithm that computes F(x) more efficiently, then it is perfectly acceptable 
to call that algorithm as a subroutine from Grover's algorithm. 

In evaluating the cost of Grover's algorithm, both the size of the search space and 
the cost of evaluating F enter. Often the discussion of Grover's algorithm ignores the cost 
of computing XF, yo 0^\X\/k) times, but as will be seen, this cost really can enter into 
the analysis in a significant way, and this is because F itself can be a very complicated 
function that may in fact embody various search and sort algorithms. Thus in the case 
that there is at most one solution x to F(x) = y , the cost of finding this solution (and 
checking that it is a solution) is 0{y^\X\ Cp) where Cp is the cost of a single evaluation 
of F (or equivalently, the cost of a single evaluation of XF, yo )- 

One other factor that needs to be mentioned is the success rate of Grover's algorithm. 
If there is at most one solution to F(x) — yo, then Grover's algorithm will find that solution 
with probability 1 — e where e = 0(\X\ ). 

It is important to note that applying Grover's algorithm in a naive way to the problem 
of finding a matching pair of elements in two lists each of size A doesn't really yield 
much of a win. If the lists are both of size A, then checking all A 2 possible pairs of 
elements for a match with Grover's algorithm would require 0(V A 2 ) = 0(A) steps. This 
should be compared to the classical algorithm described above, which has an overall cost 
of O (A log A). Other than in improvement by a logarithmic factor, the only aspect in 
which Grover's algorithm really improves on the classical algorithm is in memory usage. 
The classical algorithm require a memory size of 0(A) to hold the sorted list (and to 
actually do the sorting), while Grover's algorithm doesn't really use any memory (other 
than what is used in computing single elements on each of the lists and comparing them) . 
Thus from a pure efficiency standpoint, a naive application of Grover's algorithm to the 
matching problem is not really better than what a classical algorithm would do. 

Quantum Matching 

If the first list L\ is sorted, and the Grover search only goes over the second list, 
the cost of sorting Li is O (A log A), and there will be no improvement over the classical 
solution. On the other hand, notice that Grover search over the second list costs 0(VN) 
insertions into sorted (£i), each at a cost of O(logA), and therefore the total cost of 
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Grover searching over the second list to find a match in sorted (Li) is 0(VN log N), 
which is dominated by the cost of the sort of L\. If it were possible to only do the sort on 
a set of size \/N instead of a set of size iV, then the cost of the sort would be 0(v^Vlog iV) 
also. 

This leads to the main idea. Suppose that the list Li is broken into blocks of size 
y/~N, there being \/N such blocks. If the block containing the match could be identified, 
then sorting this block and Grover searching over Li would find the matching element 
in time O (y/~N log iV) . The idea then is to identify the block containing the match with 
an outer application of Grover's algorithm as well. The inner algorithm just described is 
easily modified to return a simple yes/no answer on whether a block contains a match to 
an element in Li. Using this as an indicator function, the cost of running the outer Grover 
algorithm on the set of \[N blocks is 0(\fN) times the cost of computing this indicator 
function. The overall cost is therefore O (iV 3 / 4 log iV) . 

In this outer loop, the indicator function is computed quantum mechanically (and 
in fact in a superposition of basis states) , which implies doing the sort and inner search 
quantum mechanically, and then also undoing them once the yes/no answer has been 
computed, and in fact, this whole procedure is repeated 0(v / iV) times. This may seem a 
little bizarre, because the actual sort algorithm is now being performed on a superposition 
of lists, each list now comprising a single block of size V~N, and of course, the outcome of 
the comparison and conditional swaps occurring in each sort operation is also being done 
in superposition. Nevertheless, however classically nonsensical this is, the laws of quantum 
mechanics do not seem to forbid such activities. 



Error Analysis 

As originally conceived, Grover's algorithm is probabilistic in nature, with the correct 
result of a database search being produced with a probability very nearly equal to 1. 
However, in this Grover-within-Grover algorithm being proposed here, a more refined error 
analysis may be called for. 

Each inner application of the Grover algorithm is a search on L2 which is a list of length 
N, so the probability of any single inner Grover subroutine call succeeding is 1 — iV -1 / 2 . 
The outer Grover algorithm calls this subroutine OfiV 1 / 4 ) times, so the probability of all 

these calls returning the correct value for the indicator function is (1 — N J which 
is still very close to 1 for large N. Another way of seeing this is to consider the expected 
number of failures of the inner Grover subroutine call. Since the probability of failure 
of any single call is about 0(A^ _1 ) and this routine is call O^N 1 / 4 ) times, the expected 
number of failures is (9(iV _3//4 ) which is well below 1 for N large. 

In any case, single isolated failures of the inner Grover subroutine call are not going 
to be disastrous for the whole algorithm, since the indicator function for the correct block 
would simply return on all its arguments, which in turn would effectively mean that the 
outer Grover iteration would be run one less time than optimal. The result is that the 
correct answer will still be found to a very high probability. 
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Conclusion 



By allowing the Grover search algorithm to be called as a subroutine in conjunction 
with a sort on blocks inside of a larger Grover search, the block insider of L\ containing 
the putative match can be identified. At the end, this block then will be classically sorted, 
at cost OiN 1 / 2 log iV) and then L2 will be Grover searched for the match with an ele- 
ment of this block, again with cost O^N 1 ^ 2 log iV). The whole algorithm then takes time 
0(iV 3 / 4 log N). 

The memory requirement for this whole algorithm is iV 1 / 2 , and this has to be quantum 
memory since it is used in superposition for the inner subroutine call. For large values of 
N, this alone could be an obstacle, regardless of how many steps the whole algorithm must 
take and maintain quantum coherence. 

It is not known whether O ( iV 3 / 4 log N) is a lower bound on the matching problem for 
a quantum computer, but it pretty clearly beats the classical lower bound which has to 
be at least N, since the two lists themselves are that long. It is not known whether there 
are faster quantum sorting algorithms that might be employed, but it is known that faster 
than classical algorithms exist for insertion into a sorted list. The algorithm in [FGGS] is 
better than the best such classical algorithm, but only by a constant, and it is known from 
[A] that a constant speedup is the most that can be hoped for. 
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